Extending HeidelTime for Temporal Expressions Referring to Historic Dates

نویسندگان

  • Jannik Strötgen
  • Thomas Bögel
  • Julian Zell
  • Ayser Armiti
  • Tran Van Canh
  • Michael Gertz
چکیده

Pattern / Normalization Resources •1–4 digit years • “BC/AD” phrases Rules •several modifications and new rules • rules with explicit “BC/AD” phrases • rules without “BC/AD” phrases Normalization Strategies •calculations for relative and underspecified expressions •disambiguation of 2-digit years •expressions without “BC/AD” phrases ∗preliminary normalized to AD dates ∗ if explicit BC dates in documents, then check for chronology Evaluation

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

French Resources for Extraction and Normalization of Temporal Expressions with HeidelTime

In this paper, we describe the development of French resources for the extraction and normalization of temporal expressions with HeidelTime, a open-source multilingual, cross-domain temporal tagger. HeidelTime extracts temporal expressions from documents and normalizes them according to the TIMEX3 annotation standard. Several types of temporal expressions are extracted: dates, times, durations ...

متن کامل

Chinese Temporal Tagging with HeidelTime

Temporal information is important for many NLP tasks, and there has been extensive research on temporal tagging with a particular focus on English texts. Recently, other languages have also been addressed, e.g., HeidelTime was extended to process eight languages. Chinese temporal tagging has achieved less attention, and no Chinese temporal tagger is publicly available. In this paper, we address...

متن کامل

Temponym Tagging: Temporal Scopes for Textual Phrases

For many NLP and IR applications, anchored temporal information extracted from textual documents is of utmost importance. Thus, temporal tagging – the extraction and normalization of temporal expressions – has gained a lot of attention in recent years and several tools such as HeidelTime and SUTime are proposed. However, such tools do not address textual phrases with temporal scopes like “Clint...

متن کامل

Tuning HeidelTime for identifying time expressions in clinical texts in English and French

We present work on tuning the Heideltime system for identifying time expressions in clinical texts in English and French languages. The main amount of the method is related to the enrichment and adaptation of linguistic resources to identify Timex3 clinical expressions and to normalize them. The test of the adapted versions have been done on the i2b2/VA 2012 corpus for English and a collection ...

متن کامل

HeidelTime: Tuning English and Developing Spanish Resources for TempEval-3

In this paper, we describe our participation in the TempEval-3 challenge. With our multilingual temporal tagger HeidelTime, we addressed task A, the extraction and normalization of temporal expressions for English and Spanish. Exploiting HeidelTime’s strict separation between source code and languagedependent parts, we tuned HeidelTime’s existing English resources and developed new Spanish reso...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014